PAL Group|Pattern Analysis and Learning Group

Home

People

Projects

Publications

Seminar

Activities

Data&Codes

CASIA-Tencent Chinese Traffic Sign Understanding Dataset

Introduction

The “CASIA-Tencent Chinese Traffic Sign Understanding Dataset” (CTSU Dataset) was built by the National Laboratory of Pattern Recognition (NLPR), Institute of Automation of Chinese Academy of Sciences (CASIA), and T Lab, Tencent Map, Tencent Technology (Beijing) Co., Ltd. The images in our dataset are from car camera videos of cities in China. The area covered includes not only developed areas such as urban streets, urban expressways, and highways, but also some underdeveloped areas such as rural roads. Our CTSU Dataset contains 5000 traffic signs, 16463 descriptions, 31536 relationship instances, 43722 components, including 18280 texts, which provides abundant annotations for traffic sign understanding. All images are divided into a training set of 4000 images and a test set of 1000 images with the same category distribution.

CASIA-Tencent CTSU.zip (232.6MB)

Annotations

1. Sign Categories: All traffic signs are manually classified into 13 categories. The names and numbers of these categories are shown in Table.1.

Table.1

id	category	number
1	Guidance information	1100
2	Lane information	900
3	Direction information	1000
4	Service area	420
5	One-way lane	50
6	Turn around	50
7	Underground tunnel	50
8	Bicycle lane	250
9	Bus lane	600
10	Yield to peds	100
11	Emergency lane	200
12	Speed measurement	100
13	Roadside sign	180

Our dataset is unbalanced. Users can use some image enhancement methods, such as color jittering, slight angle rotation, and image distortion.

2. Components: Components in traffic signs include texts and symbols. Here we divide all the components into three groups: texts, symbols, and arrowheads. All the components and their class ids can be found in Table.2.

Table.2

3. Relations: Relations defined in CTSU Dataset include two types: association relation and pointing relation.

4. Sematic Descriptions: Semantic descriptions are in form of <key: value>, in which the items in value all belong to a specific place name or road name, and they are out of order.

5. Information Matching Metric: Our IM metric can be available in the following script eval_im.zip (4KB).

Dataset Format

Annotation files are organized in JSON format. The name of the picture and the corresponding annotation file are the same. The format of the annotation file is as follows:

Performances of our methods

The performances of our methods in paper [1] and [2] are shown in Table 1. Notably, due to the entity category annotations newly included in semantic descriptions, the results of information matching differ from those in paper [1].

Table.3

[1] Guo Y, Feng W, Yin F, et al. Learning to understand traffic signs[C]//Proceedings of the ACM International Conference on Multimedia (ACM MM), 2021: 2076-2084.

[2] Guo Y, Feng W, Yin F, et al. SignParser: An End-to-End Traffic Sign Understanding Framework [J]//International Journal of Computer Vision (IJCV), 2023: 1-17.

Condition of Use

The CASIA-Tencent Chinese Traffic Sign Understanding Dataset, built by CASIA and Tencent, are released for academic research free of cost under an agreement.

Commercial use of the databases is subject to charge. For possible license of commercial use, please contact Fei Yin (fyin@nlpr.ia.ac.cn).

Reference

The CTSU Dataset was first used in the research work refered to as

Yunfei Guo, Wei Feng, Fei Yin, Tao Xue, Shuqi Mei and Cheng-Lin Liu. Learning to Understand Traffic Signs[C]//Proceedings of the ACM International Conference on Multimedia (ACM MM), 2021: 2076-2084.

Contact

Cheng-Lin Liu (liucl@nlpr.ia.ac.cn), Fei Yin (fyin@nlpr.ia.ac.cn)

National Laboratory of Pattern Recognition (NLPR)

Institute of Automation of Chinese Academy of Sciences

95 Zhongguancun East Road, Beijing 100190, P.R. China

Conferences

24th International Conference on Pattern Recognition

15th International Conference on Frontiers in Handwriting Recognition

10th IAPR-TC15 Workshop on Graph-based Representations in Pattern Recognition

Link

Chinese Academy of Sciences

Institute of Automation